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Abstract In this work, we present a generalized methodology for analyzing 
the convergence of quasi-optimal Taylor and Legendre approximations, appli¬ 
cable to a wide class of parameterized elliptic PDEs with finite-dimensional 
deterministic and stochastic inputs. Such methods construct an optimal index 
set that corresponds to the sharp estimates of the polynomial coefficients. Our 
analysis furthermore represents a new approach for estimating best M-term 
approximation errors by means of coefficient bounds, without using Stechkin 
inequality. The framework we propose for analyzing asymptotic truncation 
errors is based on an extension of the underlying multi-index set into a con¬ 
tinuous domain, and then an approximation of the cardinality (number of 
integer multi-indices) by its Lebesgue measure. Several types of isotropic and 
anisotropic (weighted) multi-index sets are explored, and rigorous proofs reveal 
sharp asymptotic error estimates in which we achieve sub-exponential conver¬ 
gence rates (of the form Mexp(—( kM)^/-^), with k . a constant depending on 
the shape and size of multi-index sets) with respect to the total number of 
degrees of freedom. Through several theoretical examples, we explicitly derive 
the constant k and use the resulting sharp bounds to illustrate the effective¬ 
ness of Legendre over Taylor approximations, as well as compare our rates 
of convergence with current published results. Computational evidence com- 
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plements the theory and shows the advantage of our generalized framework 
compared to previously developed estimates. 


1 Introduction 

This paper focuses on a relevant model boundary value problem, involving 
the simultaneous solution of a family of equations, parameterized by a finite¬ 
dimensional vector y = {yi,... ,y]y) G F = H^ion a bounded 
Lipschitz domain D C d G {1, 2, 3}. In particular, we consider a differ¬ 
ential operator C defined on D, and let a(x,y), with x G D and y G F, 
represent the input coefficient associated with the operator C. The forcing 
term / = f{x) G L'^{D) is assumed to be a fixed function of x G D. We 
concentrate on the following parameterized boundary value problem: for all 
y G F, find u(-, y) : H —>■ K, such that the following equation holds 

^a{-,y))[u(-,y)] = f(-) in D, (1.1) 

subject to suitable (possibly parameterized) boundary conditions. We require 
a and / to be chosen such that system (1.1) is well-posed in a Banach space, 
with unique solution m, such that, when suppressing the explicit dependence 
on a;, the map y i-G u(y) is defined from the parameter domain F into the 
solution space V{D). 

Problems such as (1.1) arise in contexts of both deterministic and stochas¬ 
tic modeling. In the deterministic setting, the parameter vector y is known 
or controlled by the user, and a typical goal is to study the dependence of u 
on these parameters, e.g., optimizing an output of the equation with respect 
to y (see [10,30] for more details). On the other hand, stochastic modeling is 
motivated by many engineering and science problems in which the input data 
is not known exactly. A quantification of the effect of the input uncertainties 
on the output of simulations is necessary to obtain a reliable prediction of the 
physical system. A natural way to incorporate the presence of input uncertain¬ 
ties into the governing model (1.1) is to consider the parameters {yn{'-^)}n^i 
as random variables and y{uj) : [2 ^ F a random vector, where ui G F2 and 17 
is the set of outcomes. In this setting, we assume the components of y have a 
joint probability density function (PDF) g : F —>■ IR+, with g G L°°{F) known 
directly through, e.g., truncations of correlated random fields [20,28,29,36], 
such that the probability space is equivalent to {F, B{F), g{y)dy), where B{F) 
denotes the Borel cr-algebra on F and g{y)dy is the probability measure of y. 

Monte Carlo (MC) methods (see, e.g., [18]) are the most popular ap¬ 
proaches for approximating high-dimensional integrals, such as expectation or 
two-point correlation, based on independent realizations rt(yfe), fc = 1,..., M, 
of the solution to (1-1); approximations of the expectation or other Qols are 
obtained by averaging over the corresponding realizations of that quantity. 
The resulting numerical error is proportional to thus, achieving con¬ 

vergence rates independent of dimension but requiring a very large number 
of samples to achieve reasonably small errors. Moreover, MC methods do not 
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have the ability to simultaneously approximate the solution map y i— u{y), 
since they are quadrature techniques and do not exploit the fact in many sce¬ 
narios, the solutions smoothly depend on the coefficient a Taking this smooth 
dependence into account, several global polynomial approximation techniques, 
for instance, intrusive Galerkin methods [3, 35] and non-intrusive collocation 
methods [2,32], have been proposed, often featuring much faster convergence 
rates. 

Let iS = {t' = {vi)i<i<N '■ G G N}. Global polynomial approximation meth¬ 
ods seek to build an approximation ua to the solution u of the form: 



( 1 . 2 ) 




for a finite multi-index set d. C 5, where is a multivariate polynomial in 
span{yf^ : pb < u} ioi u & A and (D) is the coefficient to be computed, 

both of which are method specific. Here, for two vectors u, fi G S, we say 
fj, < u ii and only if pi < Vi for all 1 < i Also, given a. = (ai)i<i<Ar 

a vector of real numbers, we define oA' = rii<i<w with the convention 
0° := 1. We will often suppress the dependence on x and use the notations 
u(y) := u{-,y) and a{y) := a{-,y) without loss of generality. In this paper, 
we are interested in solving (1.1) using a class of polynomial approximations 
based on the Taylor and Legendre expansions of solution u. The polynomial 
basis considered herein is thus given by the monomials ^i,{y) — y'^ (in the 
former case) and the tensorized Legendre polynomials ^u{y) = L^{y) (in the 
latter case). 

The evaluation of ua in (1.2) requires the computation of #(A) coefficients 
Cjy(x) S V(D), where #(A) is the cardinality of A. A naive choice of A and their 
corresponding polynomial spaces Pyi(A) = span{tf'j^(y), v> G A}, for instance, 
tensor product polynomial spaces, could lead to an infeasible computational 
cost, especially when the dimension of the parameter domain is high. It is 
important to be able to construct the set of the most effective indices for the 
approximation (1.2), which provides maximum accuracy for a given cardinality. 
In other words, given a fixed M G N, one searches for a set A which minimizes 
the error u — J^^eA among all index subsets of S of cardinality M. This 
practice has been known as best M-term approximations. 

The literature on the best M-term Taylor and Galerkin approximations has 
been growing fast recently, among them we refer to [6,7,9,11-14,22,23,25]. 
In the benchmark work [14] , the analytic dependence of the solutions of para¬ 
metric elliptic PDEs on the parameters was proved under mild assumptions 
on the input coefficients, and convergence analysis of the best M-term Tay¬ 
lor and Legendre approximations was established subsequently. Consider, for 
example, the expansion of it on T = [—1,1]^ by a family of L°° normalized 
polynomials, i.e., ||tf'i/||Loo(r) = 1- Application of the triangle inequality yields 



(1.3) 
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which suggests determining the optimal index set by choosing the set 

of indices u corresponding to M largest ||ci/||y(_D)- Here, denotes the com¬ 
plement of A in S. In [14], the error of such approximation was estimated due 
to Stechkin inequality (see, e.g., [16]) such that 

\\Cv\\v(D) < ll(l|Ci.||y(D))||^P(S)-^^ (1-4) 

where p is some number in (0,1) such that (||c^||\/(_D))i/e5 is t?^-summable. 
It should be noted that the convergence rate (1.4) does not depend on the 
dimension of the parameter domain F (which is possibly countably infinite 
therein). This error estimate, however, has some limitations. First, explicit 
evaluation of the coefhcient ||(||cj/j|y(£)))lj^p( 5 ) is inaccessible in general (thus 
so is the total estimate). Secondly, (1.4) often occurs with infinitely many 
values of p and stronger rates, corresponding to smaller p, are also attached to 
bigger coefficients. For a specific range of M, the effective rate of convergence 
is unclear. In implementation, finding the best index set and polynomial space 
is an infeasible task, since this requires computation of all of the Cu- As a 
strategy to circumvent this challenge, adaptive algorithms which generate the 
index set in a near optimal, greedy procedure were developed in [11]. This 
method however comes with a high cost of exploring the polynomial space, 
which may be daunting in high-dimensional problems. 

Instead of building the index set based on exact values of polynomial co¬ 
efficients c^, an attractive alternative approach (referred to as quasi-optimal 
approximation throughout this paper) is to establish sharp upper bounds of 
Cjy (by a priori or a posteriori methods), and then construct the index set 
corresponding to M largest such bounds. For this strategy, the main 
computational work for the selection of the (near) best terms reduces to de¬ 
termining sharp coefhcient estimates, which is expected to be signihcantly 
cheaper than exact calculations. Quasi-optimal polynomial approximation has 
been performed for some parametric elliptic models with optimistic results: 
while the upper bounds of ||c^j|y(£)) (denoted from now by B[v)) were com¬ 
puted with a negligible cost, the method was comparably as accurate as best 
M-term approach, as shown in [6,7]. The Hrst rigorous numerical analysis 
of quasi-optimal approximation was presented in [6] for B{u) = p~'^ with p 
being a vector {pi)i<i<N with pi > \ Vb In that work, the asymptotic sub¬ 
exponential convergence rate was proved based on optimizing the Stechkin 
estimation. Briefly, the analysis applied Stechkin inequality to yield 

y] B{u)<\\B{u)\\,,(s)M^--., (1.5) 

then took advantage of the formula of B{u) to compute p S (0,1), depending 
on M, which minimizes j|i?(iz)j|^p( 5 )M^“p. 

Although known as an essential tool to study the convergence rate of best 
M-term approximations, Stechkin inequality is probably less efhcient for quasi- 
optimal methods. As a generic estimate, it does not fully exploit the available 
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information of the decay of coefficient bounds. In such a setting, a direct 
estimate of viable and advantageous to provide an 

improved result. In addition, the process of solving the minimization problem 
p* = argminpg(Q needs to be tailored to S(iz), making this 

approach not ideal for generalization. Currently, this minimization approach 
has been limited for some quite simple types of upper bounds. In many sce¬ 
narios, the sharp estimates of the coefficients may involve complicated bounds 
which are not even explicitly computable, such as those proposed in [14]. The 
extension of this approach to such cases seems to be impossible. 

In this work, we present a generalized methodology for convergence analy¬ 
sis of quasi-optimal polynomial approximations for parameterized PDEs with 
deterministic and stochastic coefficients. As the errors of best M-term ap¬ 
proximations are bounded by those of quasi-optimal methods 

\\ciy\\v(D) < l|cj^lly(n) < X! (I-®) 

our analysis also gives accessible estimates (i.e., estimates depending only on 
known or computable terms) for best M-term approximation errors under 
several established properties on the decaying of the polynomial coefficients. 
These are sharp explicit theoretical estimates in the scenario that: 1) the tri¬ 
angle inequality (1.3) has to be employed; 2) one has to evaluate via their 
bounds B{u), in which case (1.6) represents the smallest accessible bound of 
^j^g(^best)c ||ciy|jy(£i). We particularly focus on elliptic equations where the 
input coefficient depends affinely and non-affinely on the parameters (see Sec¬ 
tion 2). However, since our error analysis only depends on the coefficient upper 
bounds, we expect that the methods and results presented herein can be ap¬ 
plied to other, more general model problems with finite parametric dimension, 
including nonlinear elliptic PDEs, initial value problems and parabolic equa¬ 
tions [12,22,23,25]. Our approach seeks a direct estimate of V , .qsi.^ B{u) 

without using the Stechkin inequality. It involves a partition of B{{A'^'y) 
into a family of small positive real intervals and the corresponding 

splitting of (A^')'^ into disjoint subsets Qj of indices v, such that B{i') G Ij. 
Under this process, the truncation error can be bounded as 

B{i^) = YY - Y ■ max(Z,), 

and therefore, the quality of the error estimate mainly depends on the approx¬ 
imation of cardinality of Qj. To tackle this problem, we develop a strategy 
which extends Qj into continuous domain and, through relating the number 
of TV-dimensional lattice points to continuous volume (Lebesgue measure), 
establishes a sharp estimate of the cardinality #(Qj) up to any prescribed 
accuracy. This development includes the utilization and extension of several 
results on lattice point enumeration; for a survey we refer to [8,21]. Under 
some weak assumptions on B{u) (which are satished by all existing coefficient 
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estimates we are aware of), we achieve an asymptotic sub-exponential con¬ 
vergence rate of truncation error of the form M exp(—where k is a 
constant depending on the shape and size of quasi-optimal index sets. Through 
several examples, we explicitly derive k and demonstrate the optimality of our 
estimate both theoretically (by proving a lower bound) and computationally 
(via comparison with exact calculation of truncation error). The advantage 
of our analysis framework is therefore twofold. First, it applies to a general 
class of coefficient decay (and correspondingly, quasi-optimal and best M- 
term approximations); and second, it yields sharp estimates of the asymptotic 
convergence rates. For convenience, for the rest this paper, we will drop the 
superscript qsi and simply refer to the quasi-optimal set of cardinality M as 
Am- 

Our paper is organized as follows. In Section 2, we describe the elliptic 
equations with parameterized input coefficient and necessary mathematical 
notations. In Section 3, we present the analyticity of the solution u with re¬ 
spect to parameter and derive coefficient estimates of Taylor and Legendre 
expansions of u. The advantage of Legendre over Taylor expansions will also 
be discussed. Our main results on the convergence analysis for a general class 
of multi-indexed series 8’'^® established in Section 4. By means 

of these results, we give accessible asymptotic error estimate of several quasi- 
optimal and best-M term polynomial approximations in Section 5. Finally, 
Section 6 is devoted to further discussions on the error lower bound, as well 
as the pre-asymptotic estimate in a simplified case. 


2 Problem setting 

We consider solving simultaneously the following parameterized linear, elliptic 
PDF: 

f -V • {a{x, y)Vu{x, y)) = f{x), \/{x, y) € D x T, , . 

1 u{x,y) = 0, y{x,y)edDxr, 

on a bounded Lipschitz domain D C with the coefficient a{-,y) defined 
on r = n^i C with Fi = [—1,1], Vi G {!,... ,N}. We require the 
following assumption: 

Assumption 1 (Continuity and coercivity) There exist constants 
0 < Omin < Omax such that foT all X G D and y G F 

^min ^ n(x, y) ^ Uj^iax- 

The Lax-Milgram lemma ensures the existence and uniqueness of solution u 
in V{D) (g) L'^{F), where V{D) = F[q{D) and L‘^{F) is the space of square 
integrable functions on F with respect to the measure g{y)dy with g{y) = 
rii^i ft = '^y V F. This setting represents parametric elliptic models 

as well as stochastic models with bounded support random coefficient. We 
denote V*{D) = F[~^{D) and, without loss of generality, assume Omin = 1 in 
this work. 
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The corresponding weak formulation for (2.1) is written as follows: find 
uix, y) &V(D) (g) L‘^{r) such that 



( 2 . 2 ) 


Following the arguments in [14], we derive the convergence of Taylor and 
Legendre approximations based on the analyticity of the solution on complex 
domains. Here, the convergence is proved under the affine parameter depen¬ 
dence of diffusion coefficients for the Taylor series, but we relax this assump¬ 
tion for the Legendre series. More specifically, we only assume a holomorphic 
extension a{x, z) of a{x, y) for the complex variable z = {zi, ■ ■ ■ , ■ 

Assumption 2 (Holomorphic parameter dependence) The complex con¬ 
tinuation of a, represented as the map a : —>■ L°°{D), is a L°° {D)-valued 

holomorphic function on . 

This condition is easily fulfilled with o(x, y) consisting of polynomials, expo¬ 
nential, sine and cosine functions of the variables j/i, • • • ,yN- Below, we give 
some examples of diffusion coefficients which can be accommodated in our 
framework. The rigorous proofs and discussion on the advantage of Legendre 
over Taylor approximations will be postponed to the next section. 

Example 1. For the input coefficient depending affinely on the parameters, 

i.e.. 


N 



X G D, y G r, 


where oq G L°°{D), {4’i)i<i<N C L°°{D) such that a satisfies Assumption 1; 
both Taylor and Legendre series approximations of u{y) to (2.1) converge. 
Example 2. Consider the input coefficient defined as 



with oo S L°°{D), ao{x) > Omin > 0 Vx S 13 and {•4’i)i<i<N C L°°{D). It is 
easy to see that a(x, y) satisfies Assumptions 1-2. Thus, the Legendre series 
approximation oiu{y) to (2.1) converges for this model. 

Example 3. Consider the input coefficient defined as 



with oo S L°°{D), ao{x) > 0 Vx G D and {'(fi)i<i<N C L°°{D). We have 
a(x, y) satisfies Assumptions 1-2 and Legendre series approximation of u{y) 
to (2.1) converges. 



Another framework for establishing the convergence of Legendre series was 
presented in [12] and applied to a more general setting of parametric PDEs 
(non-elliptic, infinite dimensional noise and non-affine dependence on param¬ 
eters). This approach imposes analyticity assumptions on the solution, which 
requires nontrivial validation in practice. Instead, in this work, we focus on 
elliptic equations which allows us to derive concise, minimal assumptions on 
the input coefficient (as seen above), under which the convergence of Leg¬ 
endre approximations holds straightforwardly. It is also worth recalling that 
our error estimates only depend on the sharp upper bound of the polyno¬ 
mial coefficients. Therefore, while not studied herein, PDE models covered 
by [12,22,23,25], bringing about same types of coefficient bounds as those 
considered in Section 5, can be treated by the forthcoming analysis. 


3 Analyticity of the solutions and estimates of the polynomial 
coefficients 

Loosely speaking, the coefficients of Taylor and Legendre expansions can be 
estimated via three steps: 

1. Extending the uniform ellipticity of a from F to certain polydiscs/polyellipses 
in 

2. Proving the analyticity of the solution on those extended domains; and 

3. Estimating the expansion coefficients using the analyticity properties and 
Cauchy’s integral formula. 

We will discuss each step in the next subsections. By 3?(z) and ^(x), we 
denote the real and imaginary part of a complex number 0 . 


3.1 Complex uniform ellipticity 

The convergence of Taylor approximations is proved using the uniform ellip¬ 
ticity of the input coefficient in polydiscs containing F, based on complex 
analysis argument. 

Definition 1 Eor 0 < <5 < amin and p denoting the vector (pi)i<i<Ar with 
Pi > 1 Vt, we say a{x, y) satisfies {S, p)-polydisc uniform ellipticity assumption 
(referred to as DUE(5, p)) if there holds 

3?(a(a::, z)) > (5 

for a\\ X £ D and all z = {zi)i<i<N contained in the polydisc 

^ € C : |zi| < Pi}. 

i<i<Ar 

At the same time, Legendre expansions require the uniform ellipticity in 
smaller complex domains: the polyellipses. 
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Definition 2 For 0 < (5 < Omin and p denoting the vector with pi > 

1 Vi, we say a(x,y) satisfies (5, p)-polyellipse uniform ellipticity assumption 
(referred to as EUE(i5, p)) if there holds 


?ft(a(x, z)) > S 

for all X € D and all z = (zi)i<i<]v contained in the polyellipse 



l<i<JV 



A close look at DUE and EUE reveals the advantage of Legendre over 
Taylor expansions. The polyellipses Ep extend the real domain T in a contin¬ 
uous manner, so that if p tends toward 1, Ep shrinks to T. Thus, it is hopeful 
that the uniform ellipticity property of a{x,y) in T (Assumption 1) can carry 
over to some polyellipses Ep (at least with p close to 1). In fact, we prove that 
EUE property is a consequence of Assumptions 1 and 2. 

Lemma 1 Let a : F ^ L°°{D) he a continuous function satisfying Assump¬ 
tions 1 and 2. Then, for all S < Omin, there exists a vector p = {pi)i<i<N with 
Pi > 1 Vi such that EUE(5, p) holds. 

On the other hand, DUE always requires an extension of the coercive 
property in F to the unit polydisc to say the least, which is not possible 
generally. For illustration, the sets of z such that 3?(a(a;, z)) > 6 for all x G D 
with some fixed 5 > 0 (referred to as the domains of uniform ellipticity) are 
plotted in Figure 1 for some typical 1-dimensional parametric coefficients. The 
maximal ellipses and discs contained in these domains are shown. We observe 
that for the affine coefficient, the set spans unrestrictedfy along the imaginery 
axis, and discs covering F can easily be placed inside. It highlights the success 
of Taylor approximations for parameterized models which depend affinely on 
the parameters, [14,22,23]. This property however no longer holds for non- 
afRne, yet holomorphic diffusion coefficients. Tayfor approximations for these 
cases can be treated by a real analysis approach, but under additional strong 
constraints, see [7]. 

We close this subsection with a proof of Lemma 1. 

Proof, (of Lemma 1). Since a{z) is holomorphic in , we have 3?(a(z)) is 
a continuous mapping. By Heine-Cantor theorem, 3?(a(z)) is uniformly con¬ 
tinuous in any compact subset of C^. Fixing a 0 < <5 < aminj without loss 
of generality, we can choose ^ > 0 such that Vz € , Vz' G F satisfying 

\\z — z'll < there holds 


P(a(z))-5i(a(z0)|| 



This implies 


lfi(a(x, z)) > S — Omin + ^(a(x, z')) > <5, 





10 



Fig. 1 : Domains of uniform ellipticity for some 1-d parametric input coefficients (indicated 
in gray). The yellow lines represent the interval [—1,1]. The blue and red curves are the 
maximal discs and ellipses which can be contained in those domains, respectively. 

for all x S n, 2 : G such that || 2 : — 2 :'|| < ^ with some z' G F. Denoting F^ = 
{z G : dist( 2 ;,r') < ^}, we proceed to prove there exists p = {pi)i<i<N 
with Pi > 1 Vi such that the polyellipse £p is included in F^. 

First, consider the “polyrectangle” 



we will show that S C F^. Indeed, for every z G S, choose z' = (z')i<i<iv as 
follows: if |5i(2:i)| < 1, then z- = otherwise, z- = sgn(3fJ(zi)). It is easy 

to see that z' G F. Furthermore, for alH G {I,..., iV}, 



Thus, |zi — z'l < and we have 



This gives z G F^ and H C Tj. 

It remains to find p satisfying £p C To make this hold, we only need to 
select p such that the lengths of axes of each ellipse are less than the lengths 
of corresponding sizes of the rectangle, i.e., 



The choice of pi = + y 1 > 1 fulfills this 

£p C S C F^. There follows 


^ + 1 > 1 fulfills this condition, with which 


^{a{x, z)) > S 


for all a: G D, z: G £p and a satisfies EUE(5, p), as desired. 


□ 
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3.2 Analyticity of the solutions with respect to the parameters 


If DUE/EUE holds, according to the Lax-Milgram theorem, u{z) G V{D) 
is defined and uniformly bounded in certain polydiscs Op/polyellipses Ep con¬ 
taining r. Exploiting this fact and the analyticity of a{z) in C'^, we establish 
the analyticity of the map 2 : 1 —)■ u{z). The results given in this section essen¬ 
tially follow those in Section 2.1 of [14], but apply to the more general cases 
of smooth, non-afhne diffusion coefficients. 


Theorem 1 Assume that the coefficient a{x,y) satisfies Assumptions 1-2. If 
DUE((5, p) (^EUE((5, p) correspondingly) holds for some 0 < S < Omin and 
P = {Pi)i<i<N with Pi > 1 yi, then the function z 1 — u(z) is holomorphic in 
an open neighborhood of the polydisc Op (the polyellipse Ep correspondingly). 

Proof. We will prove this theorem for a{x,y) satisfying DUE(5, p). The 
other case should follow similarly. Defining 

A = 1 2 : S : 3?(o(a;, z)) > ^ for d\\ x G D 


the proof consists of two steps showing that 

1. int(A) is an open neighborhood of the polydisc Op] and 

2. the map 2 ; 1 —>■ u{z) is holomorphic in int(A). 

Here, int(A) is the interior of A. 

First, let us choose an arbitrary element z in Op. For B{z,r), we denote 
the open ball radius r centered at 2 ; in C^. Observing that the map 2 ; i—7> a{z) 
is holomorphic in C'^, we have 2 : 1 —)■ 3fi(a(2:)) is a continuous function in . 
There exists > 0 depending on z such that for all 2 ; G B{z,r£), 

This gives 

A A 

iR{a{x,z)) > ^{a{x,z)) “2-2’ y^ G D, z G B{z,ri), 

and B{z,rz) C A for all z G Op. We obtain z G int(A) for all z G Op, which 
concludes Step 1. 

As a{z) is holomorphic in C^, for all 2 ; G int(A), there exists 


dia(z) := lim 
h^O 


a{z + hef) 
h 


a(z) 


G L°°[D), 


where i G {1,..., N} and denotes the Kronecker sequence with 1 at index 
i and 0 at other indices. The proof of Step 2 is then similar to that of Lemma 
2.2, [14] and would be omitted here. 


□ 
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3.3 Estimates of the polynomial coefficients 

Under the analyticity properties established in Theorem 1, the convergence of 
Taylor and Legendre expansions of the solutions, as well as estimates of the 
expansion coefficients, are well-studied, e.g., in [6,12,14]. In this subsection, we 
review without proof those results in the context of finite-dimensional, possibly 
non-affine parametric coefficients. Recall that S = {u = {i'i)i<i<N ■ Vi G N}. 
For all u G S, we introduce the multivariate notations \v\ := J2i<i<N 

ul := ni<i< 7 v define the partial derivative d'^u := 

Taylor series of u{y) reads 


u{y) (3.1) 

where the coefficients gV(D) are defined as 

fi. := -^d''u{0), u G S. 

The convergence of the Taylor expansion in F and estimates of ||tj^||y(D) aie 
given in the following. 

Proposition 1 Assume that the coeffieient a{x^y) satisfies Assumptions 1-2. 
If DUE((5, p) holds for some 0 < S < amin and p = {pi)i<i<N with pi > 1 Mi 
then the Taylor series converges uniformly towards u{y) in F. 

Furthermore, we have the estimate 

11+ II ^ ll/llv*(ll) — 1 / r,\ 

||ti+||y(D) S -^- P ■ (3.2) 

[Proof was removed.] 

On the other hand, the tensorized Legendre series of u{y) is defined as 

u{y) ^'^u^PAy)^ (3-3) 


where Pv{y) = Ilti Pviiyi)^ with denoting the monodimensional Legen¬ 
dre polynomials of degree Vi according to L°° normalization ij) = 

P..(l) = l. 

A second type of Legendre expansion, which employs the normalized 
version of P^,, is also considered. Denote the multivariate polynomials Li,{y) = 
n*=i with L,,.{yi) given by 

LvAVi) '■= \/2vA+IP^i{yi)- 

The Legendre series in this case can be written as 

Ay) = X! ’’Jt'Pi'iy)- 


( 3 . 4 ) 
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We note that the coefficients u^, £ ^{D) are defined by 

/ N 

u{y)L„{y)g{y)dy and =-Ci. + 1) J . (3.5) 

The following proposition establishes estimates of ||uj/||y(D), ll'i’i^liy(D) and the 
convergence of the Legendre expansions of u in F. 

Proposition 2 Assume that the coejfieient a{x^ y) satisfies Assumptions 1-2. 
If EUE(i5, p) holds for some 0 < i5 < Oniin and p = [pi)i<i<N with pi > 1 Vi 
then we have the estimates 

N N 

WuvWviD) ^ Cp^sp (2^^ + 1), ||niy||y(£i) < \/2vi + 1, (3.6) 

i=l i=l 

where Cp^s = Il^i l{p'’-i) denoting the perimeter of the 

ellipse £p.. 

Consequently, the Legendre series Ui^Pu and converge 

towards u in L°^{r,V [D)). The series also converges towards u 

in V{D)(2)Ll{r). 

[Proof was removed.] 

Under Assumptions 1-2, we remark that EUE and, if adding affine de¬ 
pendence on parameters, DUE normally hold for infinitely many couples of 
{5,p). We call the set of all {5,p) such that EUE(5, p)/DUE((5, p) is fulfilled 
the admissible set and denote it by Ad for both cases. For a fixed u G S, the 
best coefficient bounds given by Propositions 1 and 2 will be 

WtAviD) < inf _ ||wz.|!y(D) < inf Cp,5p”‘^]J(2yi-f 1). 

(6,p)GAd 0 (6,p)GAd 

i—1 

(3.7) 

Finding an efficient computation of these infimums and algorithm to construct 
the corresponding quasi-optimal index sets is an open question. In the specific 
case where the basis functions tpi have non-overlapping supports, however, the 
vectors p solving the minimization problems in (3.7) can be found easily. In 
this case, the best a priori estimates retrieve the forms (3.2) and (3.6). Recent 
studies have shown that although these theoretical bounds are not sharp, they 
construct quite accurate polynomial spaces, see [6]. 



4 Asymptotic convergence analysis for a general class of 
multi-indexed series 

Consider a multi-indexed sequence of coefficient estimates written in the form 
{B{v))i,^S = In this section, we introduce a new, generalized 
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approach to estimating the asymptotic convergence of with re¬ 

spect to M, under some general assumptions of b which accommodate most 
types of Taylor and Legendre coefficient bounds established in current lit¬ 
erature. We recall that this truncation error represents the error of quasi- 
optimal methods, as well as accessible error of best M-term approximations. 
It is enough to conduct the analysis with being the sets of all u such that 
^ g-j some J S N. 

Our method can be summarized as follows. First, we split into a fam¬ 
ily {Qj)o&,j>J of disjoint subsets of S based on values of where Qj 

contains v> satisfying e~^~^ < < e~^, so that the truncation error can 

be bounded as 

^ ^ E (4.1) 

j>JvGQj j>J 

Obviously, finding a sharp approximation of =ff{Qj) is central to estimate (4.1). 
We define the superlevel sets Vj of iV-dimensional real points 

Vj := {u e [0, oo)"^ : > e"^'} = {u G [0, oo)"^ : 6(zz) < j}, (4.2) 

and, with notice that #iQj) = ^fi'Pj+i O Z^) — O Z^), seek to count 

points with integer coordinates in Vj. An appealing approach to solving this 
problem is to study the interplay between #{Vj O and the continuous 
volume (Lebesgue measure) of Vj. We first employ the following well-known 
result in measure theory, reflecting the intuitive fact that for a geometric body 
V in the volume of V, denoted by \V\, can be approximated by the number 
of shrunken integer points inside V, see, e.g.. Section 7.2 in [21] and Section 
1.1 in [34]. 

Lemma 2 Suppose V C is a bounded Jordan measurable set. For j G 
N, J > 0, there holds 

\V\ = lim 4 • #(^ n - X^) = hm 4 • n X^). (4.3) 

J^OC J J—>-oo 

Concerning our goal of estimating (4.1), Lemma 2 has an interesting con¬ 
sequence: If biu) is defined such that jVj = V, Vj G N, with some V C 
one obtains a simple asymptotic formula for 4f{Vj n X^): 

#{VjnX^)c,j^\V\. (4.4) 

Such approximation is powerful since, loosely speaking, it would allow re¬ 
placing #{Qj) by ((j -I- 1)-^ — j^)\'P\ and reduce (4.1) to a much easier, yet 
equivalent problem of estimating the truncation error via 

j>j 


(4.5) 
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The property that the sets iVj are unchanged over j G N is, however, re¬ 
strictive, corresponding to only a few types of coefficient upper bounds, for 
instance, b{v') is linear in ly. For this approach of estimation to be considered 
useful in general quasi-optimal approximation setting, this condition needs to 
be relaxed. 

For the technicality, we now extend definition (4.2) to equip the superlevel 
sets with real indices: for r G (0,oo), define 

Vt ■= {i^ G [0,oo)^ : > e~'^} = {u G [0,oo)'^ : b{iy) < r}. (4.6) 

Note that the assertion of Lemma 2 still holds if replacing j G N by r G (0, oo). 
We establish, in Lemma 3 below, formula (4.4) under some weaker assumptions 
on (T’r)rGR+: 

i) Vt is Jordan measurable for countably infinite r G (0, oo), 

ii) The chain [^VT)TeM.+ is either ascending or descending towards a Jor¬ 
dan measurable limiting set V C with 0 < \V\ < oo. 

As we shall see later, these properties are satisfied by most existing polynomial 
coefficient estimates. 

Lemma 3 Suppose (7^r)TGK+ a family of bounded Lebesgue measurable sets 
in satisfying either 

—Vri C —Vr2, Vri > r2 > 0, (4.7) 

Tl T2 

or —Vri D —Vt 2 , Vti > r 2 > 0. (4.8) 

Tl T2 

Denote V = Pi ^Vt if (4.7) holds and V = IJ ^Vt for the other case. If 

tGR+ rGR+ 

V is bounded Jordan measurable, |7^| > 0, and there exists a sequence (rj)jgN 
with Tj -G oo such that Vt^ is Jordan measurable for all j, there follows 

17^1= hm 4-#(^rnZ^). (4.9) 

r—>-oo 

Proof. We will give a proof with ('Pt)tgR+ satisfying (4.7). The other case 
can be shown analogously. Let e be an arbitrary positive number. By Lemma 
2 , 


^ • #(t 7^ n Z-^) —)• |7^| as r —)• 00 . 
Since V C ^Vt Vt, we can choose Ti > 0 such that Vr > Ti, 


(4.10) 
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On the other hand, from V = nTGK+ yields \'P\ = lim W'Pt\ - Let us 

pick an L > 0 so that Vl is Jordan measurable and \^'Pl\ < \'P\ + |. By 
Lemma 2, 


1 

Tlv 


(LvlFZ^) ^ 

Vl 

\L J 

L 


as r —>■ oo. 


There exists T 2 > L satisfying Vr > T 2 , 


1 




< 


Vl 

L 


l<\V\+e. 


Since t > L, we have Vt C ^Vl, which gives 

(^^Vl n < \v\ + e. ( 4 . 11 ) 

Combining (4.10) and (4.11) proves (4.9). □ 

Lemma 3 provides us with an asymptotic formula of the form (4.4) to 
approximate the number of integer points inside V-n under some conditions 
on {Vr)TeVL+- Given a coefficient upper bound it is desirable to derive 

properties of b{u) such that its corresponding superlevel sets (7^7-)rGR+ fulfill 
these conditions. For all u G [0,oo)^, define the map : (0,oo) —)• M as 

Hu{t) = -h{Tu), Vr G (0,oo). 

T 

We proceed to state and validate the following assumptions on h{u). 
Assumption 3 The map b : [0,oo)'^ —>■ M satisfies 

1. &( 0 ) = 0 and b is continuous in [0, oo)^, 

2. Hi, is either increasing in (0, oo) for all u G [0,oo)'^ or decreasing in 
( 0 , 00 ) for allu€ [0,oo)^, 

3. b[i>) G Odul). In other words, there exists 0 < c < C such that 
c|i>'| < b{i') < C'|i>'| as i' ^ oo. 


Lemma 4 Assume that b : [0, oo)^ — l K satisfies Assumption 3. For t G 
(0, oo), denote Vt = & [0, oo)^ : 6(i>') < r}. Let 


Pi {^Vt) if Hi, is increasing Vi/ G [0, oo)^, 
go — / reR+ 

U if Hi, is decreasing G [0, oo)'^. 

gr+ 


(4.12) 


Then, 0 < |7^| < oo. IfV is Jordan measurable, there holds 


\V\ = lim 


1 


T—^OO T 


N 


#{VtFZ^)- 


(4.13) 
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Proof. From the continuity of b in [0,oo)'^ (Assumption 3.1), Vt is Jordan 
measurable for all except a countable number of values of r (see [19]). 

Next, from Assumption 3.2, if H^, is increasing for all v, one has ^bir^v) < 
^b{Tiu) < 1, Vri > r 2 > 0, S [0, oo)^, which implies 

—7^1-1 C —7^t2, Vti > T2 > 0. 

Tl T2 

Since b{u) converges towards +oo as !>'—>■ oo, 7^^ is bounded for every 
T e (0,oo). It is trivial that V = nTGH+ is bounded. Let v ^ V, we 

have u ^ -Vr for r large enough. Combining with Assumption 3.3 yields 
Crjiv] > b{Tu) > T. Thus, 73(0, l/C) C V and |7^| > 0. 

If, on the other hand, H^, is decreasing for all v, then ^b{Tih>) < -^b{T 2 h') < 
1, Vti > T 2 > 0, yu G [0,oo)^, which gives 

— Vti 3 —7^T2, Vti > T2 > 0. 

n t2 

Since V = UrGR+ trivial that |7^| > 0. Furthermore, for any 

u gV, b{Tu) < T with T large enough. Combining with Assumption 3.3 that 
b{Ti>) > ct|i>'|, this implies |z>'| < Thus, V C 73(0,1/c) and |7^| < oo. 

If V Jordan measurable, since the family (7^^),.gR+ has been proved to 
satisfy the conditions of Lemma 3, we can apply this to get (4.13). □ 

As seen in the proof, the continuity of b (Assumption 3.1) assures that the 
superlevel sets Vt are “well-behaved” (Jordan measurable). Meanwhile, the 
monotonicity of H^, (Assumption 3.2) leads to the ascending (or descending) 
property of the chain {^t)- To guarantee the limiting set V is bounded 
and not null, we assume cjz^j < b[i') < C\u\ for some 0 < c < C (Assumption 
3.3), so that 73(0,1/C) CVC 73(0,1/c). It should be noted that c and C 
are generic constants, which are only utilized to represent the boundedness of 

V and do not affect our convergence rate, thus a specification of c and C is 
not necessary. In the subsequent analysis, we applies (4.13) to derive an error 
estimate, only depending on V and the parameter dimension A^, of the form 
M exp(—(M/jT^I)^/'^). This rate is consistent with the proven sub-exponential 
convergence Mexp(—( kM)^/^) for some simple coefficient upper bounds [6]. 
Nevertheless, our analysis completely exploits detailed information on the size 
and shape of the index sets in the asymptotic regime via the introduction of 

V and, as a result, acquires the optimal value of k. 

It is worth remarking that Lemma 3 requires V to be Jordan measur¬ 
able. Indeed, we show here a simple counterexample in which V is not Jordan 
measurable and (4.13) fails to hold. Consider the integer-indexed collection of 
Jordan measurable sets {Vj)j^^ defined by 

V, = j ([0,1] \ {p/q ■■P,qGZ,0<p<q<j}). 

Observing that {jVj)j^n is descending towards V = [0,1] \ Q, which is not 
Jordan measurable. We have #{Vj O Z^) = 0 Vj while |7^| = 1, contradic¬ 
tory to (4.13). The conditions on Jordan measurability of V is, however, not 
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restrictive in the context of quasi-optimal methods, since the shapes of lim¬ 
iting sets are often not very complicated, e.g., fractal. Indeed, all examples 
investigated herein show the convexity of V ^ which trivially implies its Jordan 
measurability, as required. 

The mathematical evidence that Assumption 3 is satished by published 
Taylor and Legendre coefficient estimates will be presented in Section 5. Four 
examples of upper bounds will be considered, including p~’' (as in 

(3.2)), ini(^s,p)eAd{^) (as in (3.7)), p”*" (as in (3.6)), and 

(as in [7, 13]). For now, with Lemma 3 giving an approximation for 
#(Pj nZ^), it remains to study the estimation problem (4.5). We proceed to 
prove the following supporting result. 


Lemma 5 For any N, J, L G N, if J > max | gi/N_i i g{L--^/N_i |? H gives 




3>J 


e — 1 


Particularly, 


y — , VJ> 

^ - e-1 


3>J 


A/N _ I > 


< {NFl)J^e 

3>J 

Proof. We have 
1 


-j 


® yj>-rT^ - r,A^>4. 


e-1 


3l/lV_ 1’- 


L-1 






1 + 


Lk +1 
J 


j>J k>0i=0 

We prove that for every fc>0, 0<£<L — 1, 

N 


N 




1-H 


Lk + ^ 
J 


^—J—Lk—i ^ ^—J—k 


(4.14) 

(4.15) 

(4.16) 

(4.17) 

(4.18) 


Consider £ = 0. If A: = 0, (4.18) holds trivially. If A: > 0, it is equivalent to 

N 


(^1 + ^^ or J 


> 


Lk 


g(L-l)fe/Af _ X’ 


which is true since J > 


Now, for £ > 0, observe that 


Lk J- £ 
J 


N 


= 1 + 


Lk 

y 


N 


l-£ 


Lk + J 


N 




N 
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since J > ^i/n_i and (4.18) follows. 
Combining (4.17) and (4.18) gives 


1 


^ - 

3>J 


3>J 



which yields (4.14). 

(4.15) can be obtained from (4.14) with L = 2. For (4.16), applying (4.14) 
with L = + 1, we only need to verify We have 


e — 1 
ei/^ - 1 


N-l 

>N + l, 


since > 1.5 and > 1.5 for > 4, and > 1 for all 

0 < i < N — 3, proving (4.16). □ 

It is easy to see that J2j>j is also bounded from below by 

VJ G N, 

3>J j>J 


verifying the sharpness of estimate (4.14). This sub-exponential convergence 
rate, however, is effective with J > ^i/n_i — Since e~'^ is increasing 
with respect to J for J < N, this seems not an appropriate rate to describe 
the decay of in the pre-asymptotic regime. 

We are now ready to analyze the asymptotic truncation error of the general 
multi-indexed series relevant to quasi-optimal and best M-term 

Taylor and Legendre approximations. The main result of this section is stated 
and proved below. 

Theorem 2 Consider the multi-indexed series with b : [0, oo)^ —^ 

M satisfying Assumption 3. Forr G (0,oo), denoteVr = {iz G [0,oo)^ : h{u) < r} 
and Am the set of indices corresponding to M largest Define V = 

nrGB+ orV rGR+ (t^t) as in (4.12). If V is Jordan measurable, 

for any £ > 0, there exists > 0 depending on e such that 


'Y e < Cu{s) M exp 


M 

1 ^ 1 ( 1 +£) 



for all M > Mg. Here, Cu{s) = (4e -|- Aee — 2) 

Proof. We apply Lemma 4 to get 

\r\= lim 4 -#(^rnz^). 

r—>-oo 


(4.19) 
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For a fixed £ > 0, there exists Z\£ > 0 such that for all integer j > 

^ (4.20) 

i.e., -f\V\<#{V,f^Z^)< 3 ^\V\{l + e). 

To analyze the asymptotic convergence of ^ it is sufficient to 

v^Am 

consider this sum with Am = Vj n Z^, J G N. First, observe that for all 
integer J > A^ and J > i from (4.20), 


^ e-bM < - #(r,nz^))e 

u^Vjnz^ j>J 

1 


-0 




3>J 




< 


\'p\T.[(j + ^f-j''] e-^' + i^iE 


3>J 


3>J 


e{j + lf + \3^ e-^ 


< (e-1)1^1 E-?'^® ^ + 1^1 E 


3>J 




3>J 


the last estimate coming from (j + 1)^ < ej^. 

Apply Lemma 5 with L — 2 and J > gi/w_x ? have 

y < (2e + 2£e-l)|iP|J^e-^^-. 

e — 1 


(4.21) 


Now, we need to write (4.21) in term of M = = #(’Pj H Z^). From 

(4.20), it is easy to see that 


1 


J^\V\ <M< J'^liPKl + s). 


Ni 


(4.22) 


Combining (4.21)-(4.22) gives 


y < a(£)Mexp ( - 


M 


\r\{l+e) 


1/N\ 


where Cu{e) = (4e + 4ee — 2) yj-, as desired. 


□ 


Remark 1 (Theoretical minimum cardinality M^) The error estimate (4.19) 
holds with 


M > Ms:= #('Pj, nZ^), where = max || 


(4.23) 













21 


It is shown in (4.20) that Zig is decreasing with respect to e. Thus, a stronger 
convergence rate, corresponding to smaller s, would be realized at larger car¬ 
dinality M. An evaluation of Z\g is not accessible to us in general, making 
explicit computation (or mathematical formula) of minimum cardinality Mg 
not feasible. However, in the settings where 7^ is a rational convex polytope, 
Ag can be acquired computationally. The interplay between e and Mg will be 
investigated through several examples within such settings in Section 6. 

In any case, (4.19) requires J > ^i/n_i ■ This condition can be relaxed 
with a slightly weaker estimate. Indeed, applying (4.16) instead of (4.15) in 
the proof of Theorem 2, one gets 


^ e ^ C'^(g)Mexp 


/ M 



given 


M > M^ := #(7^7' n Z^), where J' = max 


I 

- 1 ’ 



(4.24) 


(4.25) 


Remark 2 (An extension of Theorem 2 for b{v) € 0{\i'\^)) The convergence 
estimate (4.19) does not apply for \V\ = Q or V unbounded {b{v) ^ ©(lizD). 
With minor modifications in the above analysis, our results can be extended 
to a wider class of b[v) where Assumption 3.3 {b{v) S 0{\v>\)) is replaced by 
the condition that b{v') S 0{\u\^) (i.e., there exist constants 0 < c < C such 
that c\u\^ < b{u) < C\i'\l^ as iz —>■ oo) with some fixed /3 > 0. In such cases, it 
gives 


e < Cu{s) M exp 


M 


|7^|(l + g) 


/3/ArN 


as M ^ oo. Here, V = 0x618+ or P = UxgR+ (depending 

on whether z;rr/j^T is descending or ascending). 


5 Asymptotic convergence rates of quasi-optimal and best 
iVf-term approximations 

As we have seen so far, the error of a quasi-optimal polynomial approximation 
can be estimated by the series of corresponding coefficient upper bounds. 
This also represents an accessible convergence estimate for the best M-term 
approximation, as discussed in Section 1. We will verify in this section that 
for most upper bounds developed in recent publications, such series fall into 
the class of multi-indexed series analyzed in Section 4. Particularly, in all 
considered cases, the coefficient estimates, written as will be proved to 

satisfy Assumption 3 and can be treated by Theorem 2. 








22 


Given a vector p = {pi)i<i<N with pi > 1 Vi, we define A = (Ai)i<i<Ar 
such that 


\i = log Pi> 0 Vl<i<iV. 

In Section 5.1 and 5.2, we study the error analysis of quasi-optimal and best 
M-term Taylor and Legendre approximations. (However, for ease of presen¬ 
tation, in what follows, we mostly refer to the analysis as error estimate of 
quasi-optimal methods). A computational comparison of our proposed esti¬ 
mate with existing results is showing in Section 5.3. 


5.1 Error analysis of quasi-optimal Taylor approximations 

We start with the quasi-optimal methods corresponding to a basic coefficient 
bound of the form (see Proposition 1). These are reasonable schemes for 
Taylor approximations of elliptic problems with the random fields composed 
of non-overlapping basis functions. The convergence result is stated in the 
following proposition. 

Proposition 3 Consider the Taylor series ^ t^.y’^ of u. Assume that 

ly^S 


\\tAv{D) < 


\VAD) 


(5.1) 


holds for all u G S, as in Proposition 1. Denote by Am the set of indices 
corresponding to M largest hounds in (5.1). For any e > 0, there exists > 0 
depending on e such that 


sup 

yer 


i(y) “X! 

uGAm 


< 


\V-(D) 


Cu{e)Mexp - 


V{D) 



for all M > Mg. 

Proof. We have by triangle inequality 


sup 

yer 


i(y) - 

vGAm 


< E 11 ^' 

V(D) v^Am 


v\\V{D) < 


\V-{D) 


E 

v^Am 

(5.3) 


N 


For V € [0,cx))'^, define h(i') = ^ so that Mu € S. 

We notice that the quasi-optimal index sets in this case are the Total Degree 
spaces: 


N 


= {u : p-^ >e-^] = lu \ XiU, < j L Vj € N. 


2=1 
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Since Ai > 0 Vi, it is easy to check that the map b satishes Assumption 
3 with being constant Vi>'. Observing that V = 0x6®+ (x”^’") = ^ 

[0,oo)^ : < 1}, we can specify \r \ = 

We are now ready to apply Theorem 2 to obtain 


P ^ Cu{£)M exp 

u^Am 

< Cu{e)M exp 


M 


\V\il + s) 




1 , ( 1+0 ) 


which proves (5.2). □ 

We proceed to analyze the quasi-optimal Taylor approximations based on 
best analytical bound provided by Proposition 1. Although this method is 
not easily implementable, an asymptotic error estimate can be obtained as a 
simple corollary of Theorem 2. It is reasonable to assume that the set Ad of 
all admissible {S, p) is bounded: as seen through several examples in Figure 1, 
the domains of uniform ellipticity do not expand infinitely in complex plane. 


Proposition 4 Consider the Taylor series ^ t^y’^ of u. Assume 

11, 11 ^ ' c IIII (i?) —1/ /p, 

\\tAv(D) < inf 7- P (5.4) 

{S,p)eAd 0 

holds for all v £ S, as in (3.7), with Ad being bounded. Denote by Am the set 

of indices corresponding to M largest bounds in (5.4). For any e > 0, there 

exists Mg > 0 depending on e such that 


sup 

yer 


4y) - 

v^Am 


V(D) 


< \\f\\v^iD)Cu(.e)M expU 


l/ATN 


(5.5) 


for all M > Mg. Here, P = S [0, oo)^ : Pi)^i + 1 € -^d^ . 

Proof. First, we have 


sup 

yer 


i(y) - 

v&Am 


< w^’^WviD) < wfWv-iD) 

('A 


ViD) v^A^ 


v^A], 


{S.p)CiAd S 

(5.6) 


Recall that Ai = log pi Vi. With abuse of notation, we say (^, A) e Ad 


iff (S,p) e Ad. For iz S [ 0 , 00 )-^, dehne b(u) = sup ( log^ + )i so 


{S,X)eAd 


N 


i=l 
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that inf = e Vi/ G S. The quasi-optimal index sets in this case 

(&,p)(^Ad ^ 

are: 


Vj n 


u G S : sup 

{S,X)eAd 


N 


log d + > A, 


i=l 



yj G N. 


We will show that b fulfills Assumption 3. It is easy to check that b is 
convex. As a consequence, for any t > 0, Vt and nrGH+ convex 

(and Jordan measurable). Since Ad is bounded, there exist 0 < c < C such 
that c\u\ < b{v) < C\u\ as u ^ oo. Now, let r > t' > 0, it gives 


^ l^log ^ ^ ^ ^ y U G [0, oo)^, 

since d < 1. Hence, Vi/ G [0, oo)^. 

We can apply Theorem 2 to get the asymptotic estimate 

— ly / / ^ \ 

where V := 0x611+ (r^i-) = ^ [0,oo)^ : YA=i^i^i < 1 V((5, A) e Adj. 

Combining (5.6) and (5.7) gives (5.5), concluding the proof. □ 


5.2 Error analysis of quasi-optimal Legendre approximations 

For the first example, we consider quasi-optimal methods for Legendre approx¬ 
imations of elliptic PDEs with the random field consisting of basis functions 
with disjoint supports. In [6], these problems were computationally treated 
with bounds of type (5.1) and Total Degree index sets with some success. 
However, those bounds are not analytically optimal, as the true exponential 
decay of coefficients is penalized by a large multiplier. In the following, we es¬ 
tablish a convergence analysis for the sharper upper bound p 
of Legendre coefficients (see Section 3.3). Whether the quasi-optimal method 
corresponding to this estimate outperforms Total Degree approximations in 
computation is an interesting subject to study next. 

Proposition 5 Consider the Legendre series ^ VuL^, of u. Assume that 


N 

Iki^llv(D) ^ C'p.iSP VSi/i -I-1 

i=l 


(5.8) 
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holds for all u € S, as in Proposition 2. Denote by Am the set of indices 
corresponding to M largest bounds in (5.8). For any e > 0, there exists a 
constant > 0 depending on e such that 


u — 


E 




1/GAm 


< Cl,sCui£)Mexp\-2 


V(D)(g>Ll(r) 


V (1+^) V 



(5.9) 


for all M > Mg. 

Proof. First, we have 


M - ^ v^L,^ 


V(D)0L2(r) 


v^Am 


N 


— E P + !)■ 


N 

For u e [0,oo)^, define b{u) = ^ {2\iVi — log(2r'i + 1)), so that 

2—1 

+ 1) = yu G S. We notice that the quasi-optimal index 

sets in this case given by: 


Vj n 


G 5 


N 

E 

2 = 1 


{2\vi - \og[2vi + 1)) < j 


Vj G N. 


We proceed to prove b satisfies Assumption 3. It is easy to check that b{v>) is 
continuous. Asu ^ oo, Amml^'l < b^u) < 2Amax|i^|, where Amin = mini<i<jv Ai 
and Amax = maxi<i< 7 v K- Also, observing log(ot + 1) > t log(a + 1) for every 
a > 0, 0 < t < 1, we have 


N / 1 

Hu{t') ( 2Ai^'j- — log(2rVi + 1) 


Z=1 

N 


i=l 


- E ( log(2Ti/i + 1) j =i?^( 


for all T, r' G (0,oo), t > t'. 

( N 

It is easy to see that V := p| {^'Pt) = I & [0,oo)'''^ : ^ 2\iUi < 1 

rGR+ I j=l 

Thus, V is Jordan measurable and \P\ = 2 WAr! ^PP^yiag Theorem 2, 
we obtain 


N 

E P~^'' + 1) ^ Cu{s)M exp 






for all M > Mg. This concludes our proof. 


□ 
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We remark that while the bound (5.8) is weaker than (5.1), its correspond¬ 
ing index sets are descending towards Total Degree sets. As a result, we are 
able to obtain the same convergence rate as Taylor approximations. 

Now, we apply our framework to prove a convergence estimate for quasi- 
optimal Legendre approximations based on the coefficient exponential decay 
lki^l|y(D) < ll/llv^*(D) Unlike other upper bounds discussed so far, 

this decay is established by real analysis argument [13]. In the case of affine 
linear random fields, i.e. a{x,y) = ao(x) + Piipiix), a = (ai)i<i<Af is 

specified by at = A development and implementation of quasi- 

optimal method can be found in [7]; however, no error estimate has been pro¬ 
vided. In the following result, similar to the aforementioned works, we assume 
which is necessary for the summability of sequence ( j 


Proposition 6 Consider the Legendre series Vi/Li^ of u. Assume there 

exists a veetor a = with Oi > 0 Vi and < 1 such that 


\\vAv{d) < ll/llyTD)-^«‘^ (5.10) 

for all i' G S. Denote by Am the set of indices corresponding to M largest 
bounds in (5.10). For any e > 0, there exists a constant Mg > 0 depending on 
e such that 






< \\f\\v*{D)(^u{e)Meyip \ - 


V(D)®Ll{r) 


( M 


(5.11) 


for all M > Mg. Here, V = 


jiv e (0,oo) 


N . 


N 


2=1 


log 




nf 



Proof. From (5.10), we have 


u- Y. V, 
vGAm 




2 


V(D)®Ll(r) 


v^Am 



Let Xi = — log ai > 0 V1 < i < N and T denote the gamma function. Also, 
let ipQ, ifi and 'ip 2 be tbe di-, tri- and tetra-gamma functions respectively: fjQ = 
{log ry, V-i = = (logU)", - 1^2 = V'i = (logU)'". For u G [0,oo)^, define 

= 2 y A*^'i -2 log , so that Wu G S. 


2=1 


The quasi-optimal index sets in this case are given by: 


Vj n 



N 

GS :Y - log 
2 = 1 


U(l»^l + 1) 
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We proceed to prove b satisfies Assumption 3. First, since 
one can find p G (0,1) such that X]i=i ^ by Theorem 7.2 in [13], 

have £^-summable. This gives < 1 as —)■ oo and 

there follows 

2 N 

0 ?'^ < i.e., b{v) > (2 — 2p) ^ A^t'i as i/ —>• oo. 

i=l 



Next, define ^(r) = y log j to be a mapping from (0, oo) 

to K. We will prove is decreasing by showing g is an increasing function. 
Observing that 


/ 


N 




q=2 


r [Tj2^i + ^ 

2=1 


A ( r X) + 1 ) {rvq + 1) 


without loss of generality, we can assume N = 2. Consider the first derivative 
of g: 


9'{t) 


1 / r{TVl+TV2 + ^) \ , 1 r'( tVi + TV2 + 1) . . 

‘°» [ r(r,, + l)r(r., + 1) j + 7^ r(r.i + r., + 1) 

1 r'{Tiyi + l) 1 T'(rt^2 + 1) 

t 2 + 1) ^ ^ r(rj^2 + 1) ^ 


Then ^'(t) > 0 Vr > 0 iff h{vi + 1 / 2 ) > h{vi) + h{i' 2 ), Vz^i ,^2 > 0, where 
h{s) := sV'o(s + 1) - log(T(s + 1)). 

We have h''{s) = S'ijj 2 {s + 1) + ipiis + 1) > 0 for any s > 0, see Theorem 
1, [17], so h is convex. Combining with the fact that h{0) = 0, this implies 
the superadditivity of h in [0,oo), as desired. Note that for v G (0,oo)''^, g is 
strictly increasing in (0, 00 ). 

Since is decreasing, define the limiting set V — UtgR+ '''^^1 

characterize V and show it is Jordan measurable. Without loss of generality, 
we can ignore the set of points of V in the coordinate hyperplanes, since it is 
of measure zero. Using the strictly increasing property of g for zz G (0, 00 )^, 
it gives 


u 

rGM+ 


= < zz G (0,oo) 


N 


N 


1 


2=1 


T(|rzz| + 1) ^1 


A,zz, — lim - log- 

nf=in-. + i) 2 ^ 


Applying Stirling’s formula, see, e.g., [1], yields 


1 T(|tzz| + 1) ( |tzz 

1™. Z log -— = log lim 


d+le-l^‘"l(27r)5 


= log lim 


rl‘"l + J?|zz|l‘^l+Jr 


nti (rzzO^-'-^e- 


^n^i + ke-Tl^i 


(27r)5 






-(27r) 2 - = log 
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and we obtain 

7? = J ly g (0, oo)^ : ^ XiVi - log 


N 


\V>\ 


yiN 

ni=i 




For the Jordan measurability of V, we prove V is convex. It is enough to 
show the function G{i>) ■= log —— is concave in (0, oo)^. Denote by V^G 

i li = l ^ 

the Hessian matrix of G and again assume N = 2, we have 


^ + ^^2)1/(121+122) 

\ + 122) l/{l 2 l + V2) - llv 2 


Let X = 


E \ {0}, it gives x^{XI^G)x = | | < 0 , 

by employing Cauchy-Schwarz inequality. Thus, V^G is negative semidefinite, 
which implies the concavity of G. 

We can apply Theorem 2 to get the asymptotic estimate 

\ 2 


The proof is now complete. 


□ 


5.3 A computational comparison of our proposed estimate with previously 
established rates of convergence 


Most of the established explicit error estimates for best A/-term and quasi- 
optimal approximations concern the coefficient bounds of the form 


\\tAv(D) < P 


(5.12) 


therefore are derived via the truncation error of P compare our 

approach with others in current literature in estimating this quantity. Recall, 
we proved in Proposition 3 that 


^ p < G„(e)M exp 
v^Am 


(1 + ^) ) )' 


Application of the Stechkin inequality gives 


iy^Aj\/j \'i '—1 


i/p 


M 


i-J- 


(5.13) 


(stech) 


for every 0 < p < 1. We note that (stech) holds for every M and is not 
asymptotic. 
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Development due to [6] computes p S (0,1) minimizing (stech) for each M 
and obtains 


. l/N 

K j Ni 

where ^ is the rate adjusting parameter varying from 0 to (e — l)/e. Large ^ 
gives stronger convergence but also require more restrictive minimum cardi¬ 
nality. The best convergence is only guaranteed in the limit M —>■ oo. 

Figure 2 shows a comparison of our error estimate with (stech) and (optim) 
in computing the series J2ueS (optim) is plotted at its best 

possible rate with ^ = (e— l)/e. We also plot the exact value of 
which can be calculated using Ehrhart polynomial^ in this case, for reference. 
We observe that while (stech) holds for any rate M^~p, the attached coeffi¬ 
cient is very large with small p and strong rates are not effective except at high 
cardinality; (optim) is slightly above (stech) , and both of them show consider¬ 
able discrepancy with the exact truncation error, verifying Stechkin inequality 
is not sharp. Estimate (5.13), on the other hand, is close to the true value, 
even with e large. Besides, the actual minimum cardinality for the estimate 
to hold is shown as ~ 1 for e = 4.0, ~ 10 for e = 1.0 and Mg ~ 10^ 

for e = 0.3. Also notice that {N\)^^^ ~ N/e, (optim) and (5.13) are similar, 
except for the rate adjusting parameters. While 1/(1 -I- e)^/^ in (5.13) can be 
close to 1, ^ is bounded by (e — l)/e ~ 0.65, resulting in the best convergence 

attainable by (optim) approximately M exp 



(optim) 


N 


S p-SMexpl-jlMH 



Fig. 2: A comparison of our error estimate in computing the series 
g—(1^1+1^2+21/3+41/4) those resulting from some previous approaches. 


1 


defined in Section 6 
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We consider next the problem of finding a tight upper bounds of 


error := sup 
yer 


u{y) - t^y’' 

uGAm 


V(D) 


assuming u{z) is a holomorphic function in an open neighborhood of the poly¬ 
disc Op with Pi = ... = pjf > 1 yN. We note that (5.13) holds here, since the 
exponential decay (5.12) occurs (see Section 3.3), with = log pi =: A, Vi. 
An isotropic estimate introduced in [6], when applied to this error, gives 


error < 


(1-e ^/2) ^exp log(l - e) , 


(optim-b) 


where e = ^1 —j ■ This bound is obtained based on an optimization 

of a Stechkin-type estimation, also presented in [6], 

error < (1 - - e-P^/2)-^/p, (stech-b) 


for p > 0. Another nice result due to [4,6], employing complex analysis tech¬ 
nique, proves 


error < —r -e '', 

— 1 

, which implies 

error < exp A(MA^!)^/^^ (complex) 

in asymptotic regime. 

Figure 3 plots estimate (5.13) and the upper bounds listed above in case 
A = 1 and N = 8. The exact truncation error in computing the series ®^P(~Si=i ^i) 

is also shown. It is interesting to see the (optim-b) curve is almost tangent to 
the (stech-b) lines, elucidating that (optim-b) is obtained by an optimization 
of (stech-b). Again, estimate (5.13) exhibits a much better approximation of 
the exact truncation error than (stech-b) and (optim-b). It should, however, 
be noted that (optim-b) is proved to hold with relatively small cardinalities 
(M > 1.09'^), which are not covered by our analysis. The best convergence 
rate here is given by (complex). The advantage of (complex) lies in the fact 
that unlike other approaches, it seeks to approximate the remainder of Tay¬ 
lor series ||u(t/) — directly without using triangle inequality. 

Figure 3 shows a discrepancy between (complex) and exact calculation of 
exp(—t'i), revealing triangle inequality is not sharp in all cases. 

We are, unfortunately, not aware of an extension of (complex) outside the 
isotropic setting. 


for M = 


N + J 
J 
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Q 



Fig. 3 : A comparison of our error estimate with those resulting from some previous ap¬ 
proaches in an isotropic setting. 


6 The optimality of our proposed estimation and pre-asymptotic 
error analysis: a simplified case 

In this section, we consider the particular case in which 

i) 7^ is a rational convex polytope, 

ii) Vt = tV for all r € (0, oo). 

This setting, arising from the multi-indexed sequence with 

N 

b[u) = sup(^Air'i), where A is a finite subset of (Q'*’)^, (6.1) 


is appropriate for Taylor coefficient estimate of the form (3.2) and to some 
extend, (3.7) ( as discussed in Section 5). The advantage here is that the 
number of integer points fi{Vj n can be represented by a computable 
Ehrhart quasi-polynomial of degree N in j (see [8] , Chapter 3 and [33] , Chapter 
4). In other words, there exist a period q and polynomials Eq, ..., Eg-i of 
degree N with leading coefficient |7^| such that #(7^j C = Efij) if j = 
i mod q. We exploit this property for two tasks: first, to establish a lower 
bound of and verify the sharpness of estimate (4.19); second, 

to calculate the minimum cardinalities and for (4.19) and (4.24) to 
hold (via the computations of the Ehrhart quasi-polynomials) and study the 
relation between them and the convergence rate. To circumvent the constraints 
on Mg and M', an estimate of the truncation errors in the pre-asymptotic 
regime will be derived. 
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6.1 Lower bound of the truncation errors 

We begin this section with an additional assumption on b, which is fulfilled by 
b{i>) defined in (6.1). 

Assumption 4 (Monotonically increasing) b : [0, oo)^ M satisfies: 
p G [0, oo)^, if v' < ti, then b{v) < b[^). 

Given this monotone property, the number of integer points inside a superlevel 
set Vr is always larger than its Lebesgue measure. This observation is verified 
in the following lemma. 

Lemma 6 Assume that b : [0, oo)^ —>■ IR is eontinuous and satisfies Assump¬ 
tion 4- For T G (0, oo), denote Vt = {t^ G [0, cx))-^ : b{v') < r}. We have 

#{Vrr\Z^)>\Vr\, Vr>0. 

Proof. We consider a partition of [0,oo)''^ by the family of cells [Iv)v&s 
defined as 


Iw= ^ [viWi + !)■ 

l<i<N 

Denoting S* = {u G S : Vt Iv 0}. li u G S* , hy definition, there exists 
pi G Iiy such that h{pL) < r. Since u < fi and b satisfies Assumption 4, it gives 
6(i/) < 6(p) < T. We have u G Vt F which implies S* C Vt F and 
#(5*)<#(P.nz^). 

On the other hand, there holds 

\Vr\ = Y.\rrnu\ < Y, \i.\ = #{sn- 

uGS* 

We obtain \Vr \ < ff{S*) < #{Vt OZ^), as desired. □ 

Now, we proceed to establish a lower bound for the truncation errors of 
series with b{u) having the form (6.1). 

Theorem 3 Consider the multi-indexed series with b{u) given 

by (6.1). There exists a constant M* > 0 such that 


e > Ce « exp 

v^Am 



( 6 . 2 ) 


for all M > M*. Here, V is defined as in (4.12), <^^ = 5(1)^ ” ^ 

where q is the period of Ehrhart quasi-polynomial ofV. 
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Proof. It is easy to see that b satisfies Assumption 3. Particularly, b(Tv) = 
Tb{v>), Hi, is constant and ^Vt = V = {u G [0, oo )^ : b{i') < l} for all r S 
(0, oo), h> G [0, oo)^. By definition of 6, "P is a rational convex polytope. We 
can find g G N and an A^-order polynomial E with leading coefficient \V\ such 
that 


#{V,gnZ^)=E{jq), VjGN. (6.3) 

For Am = Vjq n it gives 

^ nz^) - #(p,, nz^))e-(^+i)« 

v^Am 3^J /g 

= X! + 9 ) - E{jq)) 

3>J 

Denoting E{t) = \V\t^ + CiP, Vt G K, we have 

N-1 

E{jq + q)- E{jq) > q\V\N{jq)^-'^ 

i=0 

There exists Ti > 0 satisfying 

iV-l 

h\iijq + qy ^ < ^iVlNijq)^ \ Vj G N, j > Ti. (6.6) 

i^O 

Combining (6.4)-(6.6) yields for J > Ti, 

i'^Am j>J 

> ^ = ^Nq{qJ)^-yr\^^. 

j>J ® 

We need to write this estimate in term of the cardinality M. First, notice that 
b satisfies Assumption 4, there holds 

\v\{JqA = \rjq\<#{rjqnz^). ( 6 . 8 ) 

Applying Theorem 2, it gives \V\ = lim ,. y • ffiVjq nZ^). We can choose 

j—^oo 

T 2 > 0 such that for all j G N, j > T 2 , 

-\\v\< |P|-^.#(p^.,nzy. (6.9) 

Since M = ffiVjq n Z^), from (6.8) and (6.9), one has 

|P|(Jg)^ < M < ^|P|(Jg)^ forJ>r2. 


( 6 . 10 ) 
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Combining (6.7) and (6.10) gives 

where = ‘‘ ■ The proof is now complete. □ 

Theorem 2 and Theorem 3 reveal that for h{v) given by (6.1), the asymp¬ 
totic truncation error of can be bounded from below and above 

as 



where Ci and C„(£) are mild constants in comparison with the total bounds. 
The optimality of our estimation is verified in these cases. 

6.2 Asymptotic minimum cardinalities and their relation with the 
convergence rate 

In this section, we will apply Ehrhart (quasi-)polynomial to investigate the 
minimum cardinality for our asymptotic convergence rate to hold. Recall that 
for any e > 0, the upper estimates (4.19) and (4.24) occur with J > = 

max I 1 I ^cid J > J'^ = max | gi/N_i i 4\e | , respectively. The first 

constraints in both conditions are straightforward and we focus on quantifying 
Ag. From (4.20), is the positive real number such that 

lf\V\ < n Z^) < f\V\{l + e), Vj G N, j > Zl,. (6.11) 

In case 7^ is a rational convex polytope and Vt = tV Vt G ( 0 , 00 ), we can 
ignore the left inequality of (6.11), which by Lemma 6 is true for all j G N. 
There exists a (quasi-) polynomial 

Af-l 

E*{3) = \V\f (6.12) 

with c* : N —>■ Q being a periodic function with integer period q such that 

#(7^,nZ^) = F*(j), Vj gN, (6.13) 

see [8], Chapter 3 and [33], Chapter 4. Replacing (6.13) to (6.11), A^ can be 
characterized as the largest among the solutions of 

s\p\j^ - c*{j)f = 0. 

z=0 
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The numerical computation of formula of Ehrhart polynomial E*(j) can be 
done efficiently [15], allowing us to quantify and the theoretical minimum 
cardinality (= accurately. We present a brief study on the relation 

between and ^ for some polytopes, including: 




(p.i) 

(P.2) 

(P.3) 

(P.4) 

(P.5) 

(P.6) 


Hi') = (^ = 4 ), 

b{iy) = V 1 + V 2 + 2v3 + 4v4 (N = a), 

Hi') = T,Li’'i (^ = 8 ), 

Hi') = Eh ^{N = 8 ), 

h{u) = sup { 4 X;Li I'i^ ^ Eh : 1 < J < s} (A^ = 8), 

Hi') = sup 14 Y.h i'^^ I Eh : 1 < J < sj {N = 8). 


(P.1)-(P.4) correspond to 4- and 8-simplices with different levels of anisotropy. 
The lengths of edges connecting the origin and other vertices are equal for (P.I) 
and (P.3) and slightly vary for (P.2), while (P.4) is quite a skinny simplex. On 
the other hand, (P.5) is a truncated, enlarged version of (P.3) where the vertices 
are at 4 of the way along the axis edges and | along other edges, resulting in 
a polytope with 65 vertices. (P.6) in turn is obtained through an enlargement 
of (P.5). We note that (P.2) and (P.3) correspond to the coefficient bounds 
illustrated in Section 5.3. 


Figure 4 shows the variation of and as well as the rate adjusting 
parameter 1/(1 + e)^/^ in the estimates (4.19) and (4.24) with respect to 
s. The other parameter Cu{s) is negligible except for e very large and not 
plotted here. The formulas of Ehrhart polynomials are calculated using the 
software package LattE [5]. First, we observe that choosing a smaller e gives 
a stronger convergence, yet Mg must also be increased. The good news is that 


while the best convergence M exp 


(-( 


l-PI/ 


is realized only as e —>■ 0, e 


need not to be small to obtain a strong rate, especially in high dimension. For 
instance, e = 1.0 gives the rate ~ M exp f —0.83 ( |^] 

l/A^^ 


M exp ^—0.92 ^ with N = 8. 


with N = 4 and 


Not surprisingly. Mg and M/ is shown to be larger for higher dimension. 
For a fixed N, the anisotropy of the polytopes significantly impacts Mg and 
Mg', these values are close for (P.3) and (P.5), which possess different shapes 
and scales but span equally in coordinate axes, and much larger for (P.4), the 
simplex with skinny shape. Generally, increasing e alleviates the restrictions on 
Mg and M/, as this will reduce Ag. The strategy is, however, ineffective once 
(or gi/A_i ) exceeds Ag and dominates (4.23) and (4.25), at which 
point, these conditions can no further be relaxed. Thus, while Mg and M' 
are almost not affected by the scale of polytopes with e close to 0, their lower 
bounds (imposed by J > gi/h-i — ^) more restrictive for large polytopes; 
in such cases, mild constraints on Mg and Mg may be unattainable. This fact is 
illustrated by a comparison of two similar polytopes (P.5) and (P.6) in Figure 
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4: Mg and M' eventually stop to decay in both cases, but the bound is higher 
for (P.6), the polytope with larger scale. 

In short, our asymptotic convergence analysis applies to the range J > N. 
In the next part, we propose an alternative estimate of truncation errors, which 
is effective in the pre-asymptotic regime J < N. Let us remark that the actual 
condition on Mg for (4.19) to hold can be much milder than the theoretical 
minimum cardinality posed by Theorem 2 and investigated here, as shown in 
Figure 2 and 3. 



Fig. 4: The variation of the rate adjusting parameter and theoretical minimum cardinalities 
Me and M^ for the upper estimate ( 4 . 19 ) with respect to e. 


6.3 A pre-asymptotic estimate of truncation errors 

To acquire an estimation of 'YIiv^Am pre-asymptotic regime, follow¬ 

ing the arguments in Theorem 2, non-asymptotic bounds of n ’L^) and 
need to be established. An upper bound of #{Vj is derived 

in the following lemma. 

Lemma 7 Let b : [0,oo)'^ —>■ M 6e continuous and satisfy Assumption 4- 
Assuming that b{TV>) = Tb{v) for all t € (0, oo), v G [0, oo)^. For r S (0, oo), 
denote Vt = {i^ G [0,oo)'''^ : b{i') < t}. There follows 

n <j^-#{rn z^), Vj g n, 

where V = {u G [0,oo)'^ : b{i') <1} ( = ^Vt for all t). 

Proof. Since b{TV>) = Tb[v), we have Vr = tV, Vr > 0, thus, 

#{Vj n z^) = #(jT nz^) = #(rn jZ^'^ , Vj g n. 

Given fi G {v H j^^) > M can be written uniquely in the form 

= ^ + 
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where u G S and € Z, 0 < < j — 1, V1 < i < iV. 

Since u < jJ, and b satisfies Assumption 4, it gives h{v) < b{fi) < 1 and, 
consequently, u GVD We have 

#(rn 

( ^ 

<# : v> gV n Z^, Ti e Z, 0 < Ti < j — 1, V1 < i < A' 

I i=l ^ 

=f ■#{vnz^), 

as desired. □ 

Next, we give a non-asymptotic estimate of based on tight 

approximation of for J < A + 1 . Indeed, since t !->• T^e~'^ is 

increasing in [0, A], we have 


7-1 /.J-l 

> / 

3 = 1 -^0 

Applying Theorem 4.1, [31], yields 

I exp (- 

Combining (6.14) and (6.15), it gives 


r^e '^dr. 


(j _l)(jV+l) 

A + 2 


j-i 


N^-3 


3>J i=i i=i 


< 

i=i 


N.-j _ _ f (J-l)(iV + l) 


A + 1 


exp - 


A + 2 


(6.14) 


(6.15) 


(6.16) 


A mathematical formula of the sum is not accessible. However, it 

is independent of J and can be written in term of the well-studied polylogarithm 
functions 


Lis{z) 



for z S C, | 2 ;| < 1, s S M. 


(6.17) 


see [26,27]. Combining (6.16) and (6.17), we have proved the following Lemma: 


Lemma 8 For any A, J G N, */ J < A + 1, it gives 


3>J 




f (J-1)(A+1) \ 


A + 1 


exp 


A^ + 2 


(6.18) 
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In Figure 5, we compare the performance of the asymptotic bound (4.16) 
and the pre-asymptotic bound (6.18) in estimating the truncation error of 
for ^ = 20. The pre-asymptotic estimate shows an excellent 
agreement with true value for small J; however, it cannot capture the error 
decay when J is big. The asymptotic bound, on the other hand, successfully 
predicts the convergence rate of ^rit is not effective with small 

J. 


N = 20 



Fig. 5: A comparison of the asymptotic bound (4.16) and the pre-asymptotic bound (6.18) 
in estimating for N = 20. 




We are now in the position to prove a pre-asymptotic estimation of 




o-bM 


Theorem 4 Consider the multi-indexed series with b{v>) being 

continuous and satisfying Assumption 4- Assuming that b{Tu) = rbfu) for all 
T G (0, oo), 1 / g [0, oo)^ . Forr G (0, oo), denote Vt = G [0, oo)-^ : 6(i>') < r} 
and Am the set of indices corresponding to M largest . Define V as in 

(4.12). 

For M gN, if M < ff{'PN H Z^), there holds 


e < etr 

u^Am 


Li^N (1/e) 


ETiUj 


M7f{N+l)\ 

cr^{N + 2) J 

(6.19) 


Here, a = ff{V C\ Z^) and Li denotes the polylogarithm function. 
Proof. Applying Lemma 7, it gives 


#{Vj n z^) z^) , v j e N. 


( 6 . 20 ) 
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To estimate ® it is sufficient to consider this sum with Am = 

VjCiZ^, J J < N. We have 


^ < Y.mv,+inz^) - #{rjnz^))e-^ 

u^Vjni." 3>J 

<#{rr.z^)Y,{3 + ife-^ 

3>J 


<e-#{vnz^) 


Li_N (1/e) 


jN+l 

fV + 1 


exp 


J{N + 1) \ 

N + 2 yj ’ 


by applying Lemma 8. 

From (6.20), it gives J > ( mapping 

j !->■ exp ( —) is increasing in [0, A^]. There follows 


e < ea 


Li^N (1/e) 


1 

ivTi 



Mv{N + 1) 
aw{N + 2) 


where a = #(7^ n Z^), implying the assertion (6.19). 


□ 


Remark 3 The pre-asymptotic analysis presented above does not employ Ehrhart 
polynomials, hence applies to a wider class of b than those given by (6.1). 

In this subsection, b only needs to be continuous, satisfy Assumption 4 and 
6(ri/) = T&(i/) for all r € (0, oo), i/ g [0, oo)^. 


7 Concluding remarks 

We present a new approach for analyzing the convergence of quasi-optimal 
Taylor and Legendre approximations for parameterized PDFs with finite¬ 
dimensional deterministic and stochastic coefficients. The analysis also gives 
an accessible estimates for best M-term approximation errors without em¬ 
ploying Stechkin inequality. The advantage of our framework, which is demon¬ 
strated through several theoretical examples herein, includes its applicability 
to a general class of coefficient decay and the sharp estimates of asymptotic 
errors. This work is restricted to linear elliptic equations with input coeffi¬ 
cients depending affinely on the parameter. We expect similar results to hold 
in different settings with finite parametric dimension, particularly nonlinear 
elliptic PDFs, initial value problems and parabolic equations [12,22,23,25], as 
our analysis only depends on the polynomial coefficient estimates. 

Developing algorithms for identifying quasi-optimal subspaces is the next 
natural and essential step. Two potential types of procedures for building the 
subspaces corresponding to sharp estimates of the coefficients Ci, includes a 
priori and a posteriori approaches. In the first approach, the estimates for 
are derived a priori using knowledge on the input coefficient a(x, y). Analytical 
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studies reveal that if the complex continuation of a{x,y) is an analytic func¬ 
tion in then a theoretical decaying rate p~'^ of Ci, (with p = {pi)i<i<N 
representing the size of certain TV-dimensional complex domains where real 
part of a{x,y) is bounded away from 0) can be proved. The exploration of 
polynomial subspaces thus reduces to the specification of such domains (or p 
in particular), which is expectedly significantly less computational demand¬ 
ing. Recent study [6] for a priori constructed Total Degree subspace found 
that while the theoretical estimates were not sharp, they could still provide 
good prediction on the anisotropy of the index sets. However, in practice, 
most analytical coefficient bounds lead to subspaces much more complicated 
than Total Degree and the determination of p in several cases is nontrivial, 
see [6,12-14]. It is important to develop, implement and test of effectiveness 
of a priori algorithms in such settings. 

Research on a posteriori procedures may be pursued in three directions. 
The first strategy finds the quasi-optimal index set using the theoretical coef¬ 
ficient estimates, but with p determined sharply in an a posteriori manner (by 
exact calculation of the decaying rate of in each direction i, i = 1,..., TV), 
instead of a priori (by the definition of a{x, y) as in above). The second strat¬ 
egy adaptively builds nested sequence {Am)m>q of quasi-optimal index sets 
Am at a cost that scales linearly in ^{Am)- Given Am, we construct Am+i by 
enriching Am with the most effective indices u in its neighborhood (denoted 
by M{Am)), which results in the best residual reduction. The third strategy 
first evaluates u{y) on certain finite subset of T and then constructs the quasi- 
optimal subspace based on estimates of coefficients c^, = u(y)i'iy(y)dy using 
non-intrusive methods, e.g., Monte-Carlo, collocation. We expect the explo¬ 
ration cost for this approach, mostly coming from the evaluation of u{y), to 
be a fraction of cost for computing the solution. 

Finally, the development of quasi-optimal methods for another class of 
polynomial approximation: non-intrusive interpolation or collocation methods, 
is an important problem to study. These methods are practical and convenient 
in that they allow the use of legacy, black-box deterministic numerical solver 
and the simultaneous approximation of parameterized solutions can be con¬ 
sidered as a modular post-processing step. With observation that the accuracy 
of the interpolation operator Iam dictated by the inequality 

l|w-2TA„M|lLc»(n < (1-f Lyi„) inf 

— (1 -f ^Am) 11'^ ~ '^Am II L°°{r) > 

where 'Lam denotes the Lebesgue constant, we expect that the interpolation 
schemes in the quasi-optimal subspaces recover the convergence rates described 
in this work. However, to construct a non-intrusive hierarchical interpolant, 
two difficult challenges need to be addressed. First, the number of interpolation 
points needs to remain equal to the dimension of the polynomial space, thus, 
they must be nested and increase linearly. Second, to guarantee the accuracy 
of Iam I^I > fhe Lebesgue constant must grow slowly with respect to the total 
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number of collocation points, and we will need to explore the selections of 
abscissas which optimize this growth. 
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